Linguistic & Paralinguistic Phonetic Variation in Speaker Recognition & Text-to-Speech Synthesis
نویسنده
چکیده
Phonetic variation, and especially prosodic variation, which is often paralinguistic in nature has gradually attracted more attention among speech researchers and speech scientists as one of the possible solutions to problems with automatic speaker recognition (ASrR) and text-to-speech synthesis (TTS) systems. This paper presents a brief overview of approaches to phonetic variation in ASrR and TTS, beginning with attempts to classify linguistic and paralinguistic phenomena in speech. Also, some of the problems related to paralinguistic phonetic variation and attempted solutions are discussed.
منابع مشابه
A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information
Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating them potentially can play an important role in transmitt...
متن کاملPhonetic analyses of word and segment variation using the TIMIT corpus of American english
This paper reports a set of studies of some phonetic characteristics of the American English represented in the TIMIT speech database. First we describe some relevant characteristics of TIMIT, and how we use the non-speech files on the TIMIT CD with a commercial database program. Two studies are then described: one using only the non-audio parts of TIMIT (segmental transcriptions and durations,...
متن کاملStudy on parameters of the variable threshold to detect local speech rate deceleration in Japanese spontaneous conversational speech
1. Introduction In human communication, speech conveys not only linguistic information but also emphasis, intention, attitude and so on. They are called paralinguistic information [1]. There are several researches on paralinguistic information [2,3]. Methods for modeling or detecting of paralinguistic information is useful for various application in man-machine communication such as speech synt...
متن کاملProceedings of Meetings on Acoustics
India possesses a large variety of languages and dialects spoken in different parts of the country. These languages possess some unique linguistic, phonological and phonetic properties different from European languages. Research is being done in several of Indian languages such as Hindi, Bangla, etc. to study the articulatory, acoustic, Phonetic and prosodic nature for the purpose of creating s...
متن کاملLinguistic Processor Training on Speaker Data for Unit Selection Text-to-Speech
This paper describes an approach to synthesizing personalized speech while maintaining not only speaker voice but also speaker pronunciation peculiarities. Personalization is realized by means of pronunciation models trained on speaker data contained in his/her speech database. Untrained models allow to synthesize speech in neutral normative style. On the segmental level, the transcription mode...
متن کامل